Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 416 |
| Missing cells | 908 |
| Missing cells (%) | 8.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 84.6 KiB |
| Average record size in memory | 208.3 B |
Variable types
| CAT | 16 |
|---|---|
| NUM | 8 |
| UNSUPPORTED | 2 |
STATE has constant value "416" | Constant |
COUNTRY has constant value "416" | Constant |
PRODUCTCODE has a high cardinality: 109 distinct values | High cardinality |
MONTH_ID is highly correlated with QTR_ID | High correlation |
QTR_ID is highly correlated with MONTH_ID | High correlation |
YEAR_ID is highly correlated with ORDERNUMBER | High correlation |
ORDERNUMBER is highly correlated with YEAR_ID | High correlation |
STATUS is highly correlated with ORDERDATE | High correlation |
ORDERDATE is highly correlated with STATUS and 9 other fields | High correlation |
QTR_ID is highly correlated with ORDERDATE | High correlation |
YEAR_ID is highly correlated with ORDERDATE | High correlation |
CUSTOMERNAME is highly correlated with ORDERDATE and 6 other fields | High correlation |
PHONE is highly correlated with ORDERDATE and 6 other fields | High correlation |
ADDRESSLINE1 is highly correlated with ORDERDATE and 6 other fields | High correlation |
CITY is highly correlated with ORDERDATE and 6 other fields | High correlation |
POSTALCODE is highly correlated with ORDERDATE and 4 other fields | High correlation |
CONTACTLASTNAME is highly correlated with ORDERDATE and 4 other fields | High correlation |
CONTACTFIRSTNAME is highly correlated with ORDERDATE and 4 other fields | High correlation |
ADDRESSLINE2 has 416 (100.0%) missing values | Missing |
POSTALCODE has 76 (18.3%) missing values | Missing |
TERRITORY has 416 (100.0%) missing values | Missing |
PRODUCTCODE is uniformly distributed | Uniform |
df_index has unique values | Unique |
ADDRESSLINE2 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
TERRITORY is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2020-12-12 09:57:51.156974 |
|---|---|
| Analysis finished | 2020-12-12 09:58:46.118063 |
| Duration | 54.96 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 416 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1341.96875 |
|---|---|
| Minimum | 3 |
| Maximum | 2807 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 125.75 |
| Q1 | 676.75 |
| median | 1348.5 |
| Q3 | 2027.25 |
| 95-th percentile | 2619 |
| Maximum | 2807 |
| Range | 2804 |
| Interquartile range (IQR) | 1350.5 |
Descriptive statistics
| Standard deviation | 792.1587047 |
|---|---|
| Coefficient of variation (CV) | 0.59029594 |
| Kurtosis | -1.164334936 |
| Mean | 1341.96875 |
| Median Absolute Deviation (MAD) | 675.5 |
| Skewness | 0.053092298 |
| Sum | 558259 |
| Variance | 627515.4135 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 1022 | 1 | 0.2% | |
| 2042 | 1 | 0.2% | |
| 667 | 1 | 0.2% | |
| 319 | 1 | 0.2% | |
| 1345 | 1 | 0.2% | |
| 2392 | 1 | 0.2% | |
| 1352 | 1 | 0.2% | |
| 1353 | 1 | 0.2% | |
| 330 | 1 | 0.2% | |
| 289 | 1 | 0.2% | |
| Other values (406) | 406 | 97.6% |
| Value | Count | Frequency (%) | |
| 3 | 1 | 0.2% | |
| 4 | 1 | 0.2% | |
| 5 | 1 | 0.2% | |
| 8 | 1 | 0.2% | |
| 29 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 2807 | 1 | 0.2% | |
| 2800 | 1 | 0.2% | |
| 2795 | 1 | 0.2% | |
| 2780 | 1 | 0.2% | |
| 2779 | 1 | 0.2% |
| Distinct | 45 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10253.22356 |
|---|---|
| Minimum | 10111 |
| Maximum | 10421 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 10111 |
|---|---|
| 5-th percentile | 10135 |
| Q1 | 10160 |
| median | 10226 |
| Q3 | 10367 |
| 95-th percentile | 10400 |
| Maximum | 10421 |
| Range | 310 |
| Interquartile range (IQR) | 207 |
Descriptive statistics
| Standard deviation | 96.86876241 |
|---|---|
| Coefficient of variation (CV) | 0.009447639746 |
| Kurtosis | -1.452307836 |
| Mean | 10253.22356 |
| Median Absolute Deviation (MAD) | 81 |
| Skewness | 0.2722781276 |
| Sum | 4265341 |
| Variance | 9383.55713 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10168 | 18 | 4.3% | |
| 10222 | 18 | 4.3% | |
| 10159 | 18 | 4.3% | |
| 10312 | 17 | 4.1% | |
| 10135 | 17 | 4.1% | |
| 10182 | 17 | 4.1% | |
| 10390 | 16 | 3.8% | |
| 10142 | 16 | 3.8% | |
| 10145 | 16 | 3.8% | |
| 10229 | 14 | 3.4% | |
| Other values (35) | 249 | 59.9% |
| Value | Count | Frequency (%) | |
| 10111 | 6 | 1.4% | |
| 10113 | 4 | 1.0% | |
| 10135 | 17 | 4.1% | |
| 10140 | 11 | 2.6% | |
| 10142 | 16 | 3.8% |
| Value | Count | Frequency (%) | |
| 10421 | 2 | 0.5% | |
| 10407 | 12 | 2.9% | |
| 10400 | 9 | 2.2% | |
| 10396 | 8 | 1.9% | |
| 10390 | 16 | 3.8% |
QUANTITYORDERED
Real number (ℝ≥0)
| Distinct | 38 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.01201923 |
|---|---|
| Minimum | 6 |
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 28 |
| median | 36 |
| Q3 | 44 |
| 95-th percentile | 50 |
| Maximum | 76 |
| Range | 70 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 9.916147219 |
|---|---|
| Coefficient of variation (CV) | 0.2753566012 |
| Kurtosis | 0.4791128201 |
| Mean | 36.01201923 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.3493986235 |
| Sum | 14981 |
| Variance | 98.32997567 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 31 | 21 | 5.0% | |
| 33 | 21 | 5.0% | |
| 49 | 21 | 5.0% | |
| 36 | 19 | 4.6% | |
| 48 | 19 | 4.6% | |
| 39 | 17 | 4.1% | |
| 43 | 16 | 3.8% | |
| 37 | 16 | 3.8% | |
| 38 | 16 | 3.8% | |
| 25 | 15 | 3.6% | |
| Other values (28) | 235 | 56.5% |
| Value | Count | Frequency (%) | |
| 6 | 1 | 0.2% | |
| 13 | 1 | 0.2% | |
| 20 | 15 | 3.6% | |
| 21 | 9 | 2.2% | |
| 22 | 9 | 2.2% |
| Value | Count | Frequency (%) | |
| 76 | 2 | 0.5% | |
| 66 | 1 | 0.2% | |
| 64 | 2 | 0.5% | |
| 59 | 2 | 0.5% | |
| 58 | 1 | 0.2% |
PRICEEACH
Real number (ℝ≥0)
| Distinct | 218 |
|---|---|
| Distinct (%) | 52.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82.85576923 |
|---|---|
| Minimum | 27.22 |
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 27.22 |
|---|---|
| 5-th percentile | 38.98 |
| Q1 | 66.9375 |
| median | 94.705 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 72.78 |
| Interquartile range (IQR) | 33.0625 |
Descriptive statistics
| Standard deviation | 21.19091213 |
|---|---|
| Coefficient of variation (CV) | 0.2557566278 |
| Kurtosis | -0.4574761942 |
| Mean | 82.85576923 |
| Median Absolute Deviation (MAD) | 5.295 |
| Skewness | -0.9476293672 |
| Sum | 34468 |
| Variance | 449.054757 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 100 | 182 | 43.8% | |
| 40.25 | 3 | 0.7% | |
| 90.57 | 3 | 0.7% | |
| 64.33 | 2 | 0.5% | |
| 61.15 | 2 | 0.5% | |
| 51.93 | 2 | 0.5% | |
| 61.99 | 2 | 0.5% | |
| 98.65 | 2 | 0.5% | |
| 43.27 | 2 | 0.5% | |
| 36.29 | 2 | 0.5% | |
| Other values (208) | 214 | 51.4% |
| Value | Count | Frequency (%) | |
| 27.22 | 1 | 0.2% | |
| 29.54 | 1 | 0.2% | |
| 30.59 | 1 | 0.2% | |
| 32.88 | 1 | 0.2% | |
| 33.19 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 100 | 182 | 43.8% | |
| 99.66 | 1 | 0.2% | |
| 99.58 | 1 | 0.2% | |
| 99.55 | 1 | 0.2% | |
| 99.21 | 1 | 0.2% |
ORDERLINENUMBER
Real number (ℝ≥0)
| Distinct | 18 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.663461538 |
|---|---|
| Minimum | 1 |
| Maximum | 18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 10 |
| 95-th percentile | 15 |
| Maximum | 18 |
| Range | 17 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 4.412164077 |
|---|---|
| Coefficient of variation (CV) | 0.6621429495 |
| Kurtosis | -0.551924671 |
| Mean | 6.663461538 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.6032707571 |
| Sum | 2772 |
| Variance | 19.46719184 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 45 | 10.8% | |
| 2 | 42 | 10.1% | |
| 3 | 39 | 9.4% | |
| 4 | 37 | 8.9% | |
| 5 | 33 | 7.9% | |
| 6 | 31 | 7.5% | |
| 7 | 29 | 7.0% | |
| 8 | 27 | 6.5% | |
| 9 | 24 | 5.8% | |
| 10 | 22 | 5.3% | |
| Other values (8) | 87 | 20.9% |
| Value | Count | Frequency (%) | |
| 1 | 45 | 10.8% | |
| 2 | 42 | 10.1% | |
| 3 | 39 | 9.4% | |
| 4 | 37 | 8.9% | |
| 5 | 33 | 7.9% |
| Value | Count | Frequency (%) | |
| 18 | 3 | 0.7% | |
| 17 | 6 | 1.4% | |
| 16 | 9 | 2.2% | |
| 15 | 9 | 2.2% | |
| 14 | 11 | 2.6% |
SALES
Real number (ℝ≥0)
| Distinct | 415 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3619.091899 |
|---|---|
| Minimum | 541.14 |
| Maximum | 14082.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 541.14 |
|---|---|
| 5-th percentile | 1217.6775 |
| Q1 | 2203.075 |
| median | 3244.97 |
| Q3 | 4623.255 |
| 95-th percentile | 7351.45 |
| Maximum | 14082.8 |
| Range | 13541.66 |
| Interquartile range (IQR) | 2420.18 |
Descriptive statistics
| Standard deviation | 1945.958755 |
|---|---|
| Coefficient of variation (CV) | 0.5376925508 |
| Kurtosis | 2.862834459 |
| Mean | 3619.091899 |
| Median Absolute Deviation (MAD) | 1148.97 |
| Skewness | 1.316855096 |
| Sum | 1505542.23 |
| Variance | 3786755.475 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4181.44 | 2 | 0.5% | |
| 2718.72 | 1 | 0.2% | |
| 1847 | 1 | 0.2% | |
| 2838.81 | 1 | 0.2% | |
| 6763.05 | 1 | 0.2% | |
| 3091.68 | 1 | 0.2% | |
| 3025.92 | 1 | 0.2% | |
| 3958.5 | 1 | 0.2% | |
| 3662.52 | 1 | 0.2% | |
| 2760.94 | 1 | 0.2% | |
| Other values (405) | 405 | 97.4% |
| Value | Count | Frequency (%) | |
| 541.14 | 1 | 0.2% | |
| 717.4 | 1 | 0.2% | |
| 834.67 | 1 | 0.2% | |
| 846.51 | 1 | 0.2% | |
| 856.52 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 14082.8 | 1 | 0.2% | |
| 11623.7 | 1 | 0.2% | |
| 11336.7 | 1 | 0.2% | |
| 9661.44 | 1 | 0.2% | |
| 9470.94 | 1 | 0.2% |
| Distinct | 43 |
|---|---|
| Distinct (%) | 10.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| 2/17/2005 0:00 | 22 |
|---|---|
| 10/10/2003 0:00 | 18 |
| 2/19/2004 0:00 | 18 |
| 10/28/2003 0:00 | 18 |
| 11/12/2003 0:00 | 17 |
| Other values (38) |
| Value | Count | Frequency (%) | |
| 2/17/2005 0:00 | 22 | 5.3% | |
| 10/10/2003 0:00 | 18 | 4.3% | |
| 2/19/2004 0:00 | 18 | 4.3% | |
| 10/28/2003 0:00 | 18 | 4.3% | |
| 11/12/2003 0:00 | 17 | 4.1% | |
| 7/2/2003 0:00 | 17 | 4.1% | |
| 10/21/2004 0:00 | 17 | 4.1% | |
| 3/4/2005 0:00 | 16 | 3.8% | |
| 8/8/2003 0:00 | 16 | 3.8% | |
| 8/25/2003 0:00 | 16 | 3.8% | |
| Other values (33) | 241 | 57.9% |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.7% |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 14.08894231 |
| Min length | 13 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| Shipped | |
|---|---|
| Resolved | 13 |
| On Hold | 12 |
| In Process | 2 |
| Value | Count | Frequency (%) | |
| Shipped | 389 | 93.5% | |
| Resolved | 13 | 3.1% | |
| On Hold | 12 | 2.9% | |
| In Process | 2 | 0.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.045673077 |
| Min length | 7 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| 1 | |
|---|---|
| 4 | |
| 3 | |
| 2 |
| Value | Count | Frequency (%) | |
| 1 | 158 | 38.0% | |
| 4 | 121 | 29.1% | |
| 3 | 95 | 22.8% | |
| 2 | 42 | 10.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.052884615 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 11 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 3.700484025 |
|---|---|
| Coefficient of variation (CV) | 0.6113587587 |
| Kurtosis | -1.511574605 |
| Mean | 6.052884615 |
| Median Absolute Deviation (MAD) | 3.5 |
| Skewness | 0.03422829123 |
| Sum | 2518 |
| Variance | 13.69358202 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10 | 71 | 17.1% | |
| 2 | 58 | 13.9% | |
| 1 | 52 | 12.5% | |
| 3 | 48 | 11.5% | |
| 8 | 45 | 10.8% | |
| 7 | 39 | 9.4% | |
| 11 | 30 | 7.2% | |
| 4 | 21 | 5.0% | |
| 12 | 20 | 4.8% | |
| 5 | 16 | 3.8% | |
| Other values (2) | 16 | 3.8% |
| Value | Count | Frequency (%) | |
| 1 | 52 | 12.5% | |
| 2 | 58 | 13.9% | |
| 3 | 48 | 11.5% | |
| 4 | 21 | 5.0% | |
| 5 | 16 | 3.8% |
| Value | Count | Frequency (%) | |
| 12 | 20 | 4.8% | |
| 11 | 30 | 7.2% | |
| 10 | 71 | 17.1% | |
| 9 | 11 | 2.6% | |
| 8 | 45 | 10.8% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| 2003 | |
|---|---|
| 2004 | |
| 2005 |
| Value | Count | Frequency (%) | |
| 2003 | 163 | 39.2% | |
| 2004 | 143 | 34.4% | |
| 2005 | 110 | 26.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
PRODUCTLINE
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| Vintage Cars | |
|---|---|
| Classic Cars | |
| Motorcycles | |
| Trucks and Buses | |
| Planes | |
| Other values (2) |
| Value | Count | Frequency (%) | |
| Vintage Cars | 125 | 30.0% | |
| Classic Cars | 116 | 27.9% | |
| Motorcycles | 54 | 13.0% | |
| Trucks and Buses | 52 | 12.5% | |
| Planes | 37 | 8.9% | |
| Ships | 24 | 5.8% | |
| Trains | 8 | 1.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 16 |
|---|---|
| Median length | 12 |
| Mean length | 11.31730769 |
| Min length | 5 |
MSRP
Real number (ℝ≥0)
| Distinct | 80 |
|---|---|
| Distinct (%) | 19.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.66105769 |
|---|---|
| Minimum | 33 |
| Maximum | 214 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 41 |
| Q1 | 66 |
| median | 97 |
| Q3 | 122 |
| 95-th percentile | 170.75 |
| Maximum | 214 |
| Range | 181 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 41.33318073 |
|---|---|
| Coefficient of variation (CV) | 0.4147375282 |
| Kurtosis | -0.02108227963 |
| Mean | 99.66105769 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 0.6364308194 |
| Sum | 41459 |
| Variance | 1708.431829 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 99 | 17 | 4.1% | |
| 60 | 16 | 3.8% | |
| 118 | 15 | 3.6% | |
| 136 | 13 | 3.1% | |
| 62 | 12 | 2.9% | |
| 127 | 11 | 2.6% | |
| 102 | 10 | 2.4% | |
| 101 | 10 | 2.4% | |
| 50 | 10 | 2.4% | |
| 80 | 9 | 2.2% | |
| Other values (70) | 293 | 70.4% |
| Value | Count | Frequency (%) | |
| 33 | 5 | 1.2% | |
| 35 | 5 | 1.2% | |
| 37 | 4 | 1.0% | |
| 40 | 4 | 1.0% | |
| 41 | 5 | 1.2% |
| Value | Count | Frequency (%) | |
| 214 | 6 | 1.4% | |
| 207 | 4 | 1.0% | |
| 194 | 2 | 0.5% | |
| 193 | 5 | 1.2% | |
| 173 | 4 | 1.0% |
| Distinct | 109 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| S18_3320 | 9 |
|---|---|
| S18_1367 | 7 |
| S18_4668 | 7 |
| S24_4258 | 7 |
| S18_1097 | 7 |
| Other values (104) |
| Value | Count | Frequency (%) | |
| S18_3320 | 9 | 2.2% | |
| S18_1367 | 7 | 1.7% | |
| S18_4668 | 7 | 1.7% | |
| S24_4258 | 7 | 1.7% | |
| S18_1097 | 7 | 1.7% | |
| S18_2795 | 7 | 1.7% | |
| S18_2248 | 6 | 1.4% | |
| S18_3136 | 6 | 1.4% | |
| S12_1666 | 6 | 1.4% | |
| S10_1949 | 6 | 1.4% | |
| Other values (99) | 348 | 83.7% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.2% |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.084134615 |
| Min length | 8 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| Mini Gifts Distributors Ltd. | |
|---|---|
| Corporate Gift Ideas Co. | |
| The Sharp Gifts Warehouse | |
| Technics Stores Inc. | |
| Toys4GrownUps.com | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| Mini Gifts Distributors Ltd. | 180 | 43.3% | |
| Corporate Gift Ideas Co. | 41 | 9.9% | |
| The Sharp Gifts Warehouse | 40 | 9.6% | |
| Technics Stores Inc. | 34 | 8.2% | |
| Toys4GrownUps.com | 30 | 7.2% | |
| Collectable Mini Designs Co. | 25 | 6.0% | |
| Mini Wheels Co. | 21 | 5.0% | |
| Signal Collectibles Ltd. | 15 | 3.6% | |
| Men 'R' US Retailers, Ltd. | 14 | 3.4% | |
| West Coast Collectables Co. | 13 | 3.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 28 |
|---|---|
| Median length | 27 |
| Mean length | 24.89182692 |
| Min length | 15 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| 4155551450 | |
|---|---|
| 6505551386 | |
| 4085553659 | |
| 6505556809 | |
| 6265557265 | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| 4155551450 | 180 | 43.3% | |
| 6505551386 | 41 | 9.9% | |
| 4085553659 | 40 | 9.6% | |
| 6505556809 | 34 | 8.2% | |
| 6265557265 | 30 | 7.2% | |
| 7605558146 | 25 | 6.0% | |
| 6505555787 | 21 | 5.0% | |
| 4155554312 | 15 | 3.6% | |
| 2155554369 | 14 | 3.4% | |
| 3105553722 | 13 | 3.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| 5677 Strong St. | |
|---|---|
| 7734 Strong St. | |
| 3086 Ingle Ln. | |
| 9408 Furth Circle | |
| 78934 Hillside Dr. | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| 5677 Strong St. | 180 | 43.3% | |
| 7734 Strong St. | 41 | 9.9% | |
| 3086 Ingle Ln. | 40 | 9.6% | |
| 9408 Furth Circle | 34 | 8.2% | |
| 78934 Hillside Dr. | 30 | 7.2% | |
| 361 Furth Circle | 25 | 6.0% | |
| 5557 North Pendale Street | 21 | 5.0% | |
| 2793 Furth Circle | 15 | 3.6% | |
| 6047 Douglas Av. | 14 | 3.4% | |
| 3675 Furth Circle | 13 | 3.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 25 |
|---|---|
| Median length | 15 |
| Mean length | 16.02403846 |
| Min length | 14 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| San Rafael | |
|---|---|
| San Francisco | |
| San Jose | |
| Burlingame | |
| Pasadena | |
| Other values (5) |
| Value | Count | Frequency (%) | |
| San Rafael | 180 | 43.3% | |
| San Francisco | 62 | 14.9% | |
| San Jose | 40 | 9.6% | |
| Burlingame | 34 | 8.2% | |
| Pasadena | 30 | 7.2% | |
| San Diego | 25 | 6.0% | |
| Brisbane | 15 | 3.6% | |
| Los Angeles | 14 | 3.4% | |
| Burbank | 13 | 3.1% | |
| Glendale | 3 | 0.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 9.903846154 |
| Min length | 7 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| CA |
|---|
| Value | Count | Frequency (%) | |
| CA | 416 | 100.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 76 |
| Missing (%) | 18.3% |
| Memory size | 3.2 KiB |
| 97562 | |
|---|---|
| 94217 | |
| 90003 | |
| 91217 | |
| 94019 | 13 |
| Value | Count | Frequency (%) | |
| 97562 | 180 | 43.3% | |
| 94217 | 89 | 21.4% | |
| 90003 | 30 | 7.2% | |
| 91217 | 25 | 6.0% | |
| 94019 | 13 | 3.1% | |
| 92561 | 3 | 0.7% | |
| (Missing) | 76 | 18.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.634615385 |
| Min length | 3 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| USA |
|---|
| Value | Count | Frequency (%) | |
| USA | 416 | 100.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| Nelson | |
|---|---|
| Brown | |
| Frick | |
| Thompson | |
| Hirano | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| Nelson | 180 | 43.3% | |
| Brown | 41 | 9.9% | |
| Frick | 40 | 9.6% | |
| Thompson | 38 | 9.1% | |
| Hirano | 34 | 8.2% | |
| Young | 33 | 7.9% | |
| Murphy | 21 | 5.0% | |
| Taylor | 15 | 3.6% | |
| Chandler | 14 | 3.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 5.975961538 |
| Min length | 5 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| Valarie | |
|---|---|
| Julie | |
| Sue | |
| Juri | |
| Michael | 14 |
| Other values (2) | 16 |
| Value | Count | Frequency (%) | |
| Valarie | 205 | 49.3% | |
| Julie | 92 | 22.1% | |
| Sue | 55 | 13.2% | |
| Juri | 34 | 8.2% | |
| Michael | 14 | 3.4% | |
| Steve | 13 | 3.1% | |
| Leslie | 3 | 0.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 5.713942308 |
| Min length | 3 |
DEALSIZE
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| Medium | |
|---|---|
| Small | |
| Large |
| Value | Count | Frequency (%) | |
| Medium | 208 | 50.0% | |
| Small | 181 | 43.5% | |
| Large | 27 | 6.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 6 |
|---|---|
| Median length | 5.5 |
| Mean length | 5.5 |
| Min length | 5 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | ORDERNUMBER | QUANTITYORDERED | PRICEEACH | ORDERLINENUMBER | SALES | ORDERDATE | STATUS | QTR_ID | MONTH_ID | YEAR_ID | PRODUCTLINE | MSRP | PRODUCTCODE | CUSTOMERNAME | PHONE | ADDRESSLINE1 | ADDRESSLINE2 | CITY | STATE | POSTALCODE | COUNTRY | TERRITORY | CONTACTLASTNAME | CONTACTFIRSTNAME | DEALSIZE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3 | 10145 | 45 | 83.26 | 6 | 3746.70 | 8/25/2003 0:00 | Shipped | 3 | 8 | 2003 | Motorcycles | 95 | S10_1678 | Toys4GrownUps.com | 6265557265 | 78934 Hillside Dr. | NaN | Pasadena | CA | 90003 | USA | NaN | Young | Julie | Medium |
| 1 | 4 | 10159 | 49 | 100.00 | 14 | 5205.27 | 10/10/2003 0:00 | Shipped | 4 | 10 | 2003 | Motorcycles | 95 | S10_1678 | Corporate Gift Ideas Co. | 6505551386 | 7734 Strong St. | NaN | San Francisco | CA | NaN | USA | NaN | Brown | Julie | Medium |
| 2 | 5 | 10168 | 36 | 96.66 | 1 | 3479.76 | 10/28/2003 0:00 | Shipped | 4 | 10 | 2003 | Motorcycles | 95 | S10_1678 | Technics Stores Inc. | 6505556809 | 9408 Furth Circle | NaN | Burlingame | CA | 94217 | USA | NaN | Hirano | Juri | Medium |
| 3 | 8 | 10201 | 22 | 98.57 | 2 | 2168.54 | 12/1/2003 0:00 | Shipped | 4 | 12 | 2003 | Motorcycles | 95 | S10_1678 | Mini Wheels Co. | 6505555787 | 5557 North Pendale Street | NaN | San Francisco | CA | NaN | USA | NaN | Murphy | Julie | Small |
| 4 | 29 | 10140 | 37 | 100.00 | 11 | 7374.10 | 7/24/2003 0:00 | Shipped | 3 | 7 | 2003 | Classic Cars | 214 | S10_1949 | Technics Stores Inc. | 6505556809 | 9408 Furth Circle | NaN | Burlingame | CA | 94217 | USA | NaN | Hirano | Juri | Large |
| 5 | 36 | 10215 | 35 | 100.00 | 3 | 6075.30 | 1/29/2004 0:00 | Shipped | 1 | 1 | 2004 | Classic Cars | 214 | S10_1949 | West Coast Collectables Co. | 3105553722 | 3675 Furth Circle | NaN | Burbank | CA | 94019 | USA | NaN | Thompson | Steve | Medium |
| 6 | 44 | 10312 | 48 | 100.00 | 3 | 11623.70 | 10/21/2004 0:00 | Shipped | 4 | 10 | 2004 | Classic Cars | 214 | S10_1949 | Mini Gifts Distributors Ltd. | 4155551450 | 5677 Strong St. | NaN | San Rafael | CA | 97562 | USA | NaN | Nelson | Valarie | Large |
| 7 | 46 | 10333 | 26 | 100.00 | 3 | 3003.00 | 11/18/2004 0:00 | Shipped | 4 | 11 | 2004 | Classic Cars | 214 | S10_1949 | Mini Wheels Co. | 6505555787 | 5557 North Pendale Street | NaN | San Francisco | CA | NaN | USA | NaN | Murphy | Julie | Medium |
| 8 | 48 | 10357 | 32 | 100.00 | 10 | 5691.84 | 12/10/2004 0:00 | Shipped | 4 | 12 | 2004 | Classic Cars | 214 | S10_1949 | Mini Gifts Distributors Ltd. | 4155551450 | 5677 Strong St. | NaN | San Rafael | CA | 97562 | USA | NaN | Nelson | Valarie | Medium |
| 9 | 50 | 10381 | 36 | 100.00 | 3 | 8254.80 | 2/17/2005 0:00 | Shipped | 1 | 2 | 2005 | Classic Cars | 214 | S10_1949 | Corporate Gift Ideas Co. | 6505551386 | 7734 Strong St. | NaN | San Francisco | CA | NaN | USA | NaN | Brown | Julie | Large |
Last rows
| df_index | ORDERNUMBER | QUANTITYORDERED | PRICEEACH | ORDERLINENUMBER | SALES | ORDERDATE | STATUS | QTR_ID | MONTH_ID | YEAR_ID | PRODUCTLINE | MSRP | PRODUCTCODE | CUSTOMERNAME | PHONE | ADDRESSLINE1 | ADDRESSLINE2 | CITY | STATE | POSTALCODE | COUNTRY | TERRITORY | CONTACTLASTNAME | CONTACTFIRSTNAME | DEALSIZE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 406 | 2720 | 10142 | 38 | 85.41 | 4 | 3245.58 | 8/8/2003 0:00 | Shipped | 3 | 8 | 2003 | Ships | 99 | S700_3962 | Mini Gifts Distributors Ltd. | 4155551450 | 5677 Strong St. | NaN | San Rafael | CA | 97562 | USA | NaN | Nelson | Valarie | Medium |
| 407 | 2727 | 10222 | 31 | 95.34 | 17 | 2955.54 | 2/19/2004 0:00 | Shipped | 1 | 2 | 2004 | Ships | 99 | S700_3962 | Collectable Mini Designs Co. | 7605558146 | 361 Furth Circle | NaN | San Diego | CA | 91217 | USA | NaN | Thompson | Valarie | Small |
| 408 | 2748 | 10168 | 39 | 82.91 | 17 | 3233.49 | 10/28/2003 0:00 | Shipped | 4 | 10 | 2003 | Planes | 74 | S700_4002 | Technics Stores Inc. | 6505556809 | 9408 Furth Circle | NaN | Burlingame | CA | 94217 | USA | NaN | Hirano | Juri | Medium |
| 409 | 2752 | 10222 | 43 | 74.03 | 2 | 3183.29 | 2/19/2004 0:00 | Shipped | 1 | 2 | 2004 | Planes | 74 | S700_4002 | Collectable Mini Designs Co. | 7605558146 | 361 Furth Circle | NaN | San Diego | CA | 91217 | USA | NaN | Thompson | Valarie | Medium |
| 410 | 2754 | 10250 | 38 | 62.19 | 12 | 2363.22 | 5/11/2004 0:00 | Shipped | 2 | 5 | 2004 | Planes | 74 | S700_4002 | The Sharp Gifts Warehouse | 4085553659 | 3086 Ingle Ln. | NaN | San Jose | CA | 94217 | USA | NaN | Frick | Sue | Small |
| 411 | 2779 | 10209 | 48 | 44.69 | 3 | 2145.12 | 1/9/2004 0:00 | Shipped | 1 | 1 | 2004 | Planes | 49 | S72_1253 | Men 'R' US Retailers, Ltd. | 2155554369 | 6047 Douglas Av. | NaN | Los Angeles | CA | NaN | USA | NaN | Chandler | Michael | Small |
| 412 | 2780 | 10222 | 31 | 45.69 | 7 | 1416.39 | 2/19/2004 0:00 | Shipped | 1 | 2 | 2004 | Planes | 49 | S72_1253 | Collectable Mini Designs Co. | 7605558146 | 361 Furth Circle | NaN | San Diego | CA | 91217 | USA | NaN | Thompson | Valarie | Small |
| 413 | 2795 | 10400 | 20 | 56.12 | 4 | 1122.40 | 4/1/2005 0:00 | Shipped | 2 | 4 | 2005 | Planes | 49 | S72_1253 | The Sharp Gifts Warehouse | 4085553659 | 3086 Ingle Ln. | NaN | San Jose | CA | 94217 | USA | NaN | Frick | Sue | Small |
| 414 | 2800 | 10142 | 39 | 44.23 | 5 | 1724.97 | 8/8/2003 0:00 | Shipped | 3 | 8 | 2003 | Ships | 54 | S72_3212 | Mini Gifts Distributors Ltd. | 4155551450 | 5677 Strong St. | NaN | San Rafael | CA | 97562 | USA | NaN | Nelson | Valarie | Small |
| 415 | 2807 | 10222 | 36 | 63.34 | 18 | 2280.24 | 2/19/2004 0:00 | Shipped | 1 | 2 | 2004 | Ships | 54 | S72_3212 | Collectable Mini Designs Co. | 7605558146 | 361 Furth Circle | NaN | San Diego | CA | 91217 | USA | NaN | Thompson | Valarie | Small |